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Abstract 

In this paper we investigate the existence of L^(7r)-spectral gaps for 
TT-irreducible, positive recurrent Markov chains on general state space. 
We obtain necessary and sufficient conditions for the existence of L'^{n)- 
spectral gaps in terms of a sequence of isoperimetric constants and estab- 
lish their asymptotic behavior. It turns out that in some cases the spectral 
gap can be understood in terms of convergence of an induced probabil- 
ity flow to the uniform flow. The obtained theorems can be interpreted 
as mixing results and yield sharp estimates for the spectral gap of some 
Markov chains. 



1 Introduction 

Throughout this paper we consider a time homogeneous and time discrete pos- 
itive recurrent Markov chain ^i, ^2, • ■ ■ on an arbitrary state space (fi, J^) with 
transition kernel •) and uniquely determined invariant measure tt. It is as- 
sumed that the tr- Algebra J- is countably generated. The main result of this 
paper is the precise characterization of the size of the spectral gap in terms of a 
sequence of isoperimetric constants for normal Markov operators. This charac- 
terization has many consequences, for example it will be applied to obtain sharp 
upper bounds for the L^(7r)-spectral gap for the M/M/l-queue, the hypercube 
and the star. Moreover, the theorem is of theoretical value. On the one hand 
it is used to generalize a result due to Roberts and Tweedie [ID] concerning the 
connection between geometric ergodicity and the spectral gap property (SGP). 
On the other hand, it is applied to obtain a functional analytic inequality be- 
tween the Banach space of complex valued measurable functions B{Q, J',\\-\\ac) 
and the Hilbert space L^. 
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The paper is organized as follows: First we introduce the main notions and no- 
tations needed throughout this paper, especially a certain sequence of isoperi- 
metric constants, which is used to provide a classification of those Markov chains 
having the (SGP) (section[2)- In order to obtain more precise information about 
the size of the spectral gap, we shrink the class of considered Markov chains to 
those corresponding essentially to normal Markov operators on L^. Here we 
state and prove our main results (section [3]) . These results are applied in sec- 
tion U) Finally, it is shown how the results can be ranged in the theory of mixing 
for Markov chains (section [5]). The proofs, that are not presented so far, can be 
found in section [S] 

We say that P has an L^(7r)-spectral gap if 

hm sup IIPVII2" =:P<1, 

where 

Pfix) := / f{y)P{x,dy), f G L^n) 
Jn 

and Ll^iir) := {/ G L\n) : j^f{x)iT{dx) = Q,j^f{x)Mdx) = 1}. The 
spectral gap is defined to be l — p- Let Lq{tt) {/ e i^(7r) : /^^ f{x)Ti{dx) = 0}. 
If not stated otherwise, P is regarded as an operator on the Hilbert-space Lq{tt). 
For reversible Markov chains, i.e 

TT{dx)p{x,dy) = TT{dy)p{y,dx), 

there is another useful description for p, namely 

||P||i2(,)= sup {|A|:A^l}=p= sup |(/,P/).|. (1) 

Xe<T{P) f£Lo,i{n) 

The right hand side of equation ^ is known as the Rayleigh-Ritz principle 
[IB] . Note that for reversible Markov chains the associated Markov operator P 
is selfadjoint and that the spectrum a{P) is always contained in [—1, 1]. 

Definition 1 A Markov chain ^i,(,2,--- is called n-a.s. geometrically ergodic 
if there exists d < 1, Cx > such that 

\\p"{x,-)-7t{-)\\tv <Cx5^ TT -a.s., (2) 

where || • \ \tv denotes the total-variation-norm. In the following we will assume 
that 6 is chosen optimal, i.e. 6 is as small as possible satisfying ([5]). This 
S will be called the optimal rate of geometric convergence (ORGC). We have 
the following alternative characterization of geometric ergodicity, which can be 
found in [TO] or [H], Theorem 6.14 (iii). 

Theorem 1 Let ^1,^2,- •■ be an ergodic Markov chain on a probability space 
with countable generated a-algebra T . Then we have the following equivalence: 
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1. 

34<1,C:,>0: |b"(x,-)-7r(.)|k < C,r^-a.s. (3) 

2. 

35 < 1,0 0, sitc/ii/iflt II ||p"(-,-)-7r||y||i <C(5". (4) 

There is a close relation between the (SGP) and convergence in the total 
variation-norm. Doeblin [TT] and Doob [T^ had shown that the convergence in 
([2|) holds uniformly in x (i.e. C{x) = C Vx S J7) if and only if 

lim sup ||(P"-Pi)/||i < 1, 

where Bqo denotes the space of all bounded, J^-measurable complex valued func- 
tions and Pi{f) — Et^J ~ j^f{y)-K{dy). Using the Riesz-Thorin-interpolation 
theorem one can show that this implies the existence of an L^(7r)-spectral gap. 
Actually, we will obtain this result with some additional refinements as a con- 
sequence of the theory developed in this paper. 

The following families of isoperimetric constants will play a crucial role dur- 
ing this work: 

kn := inf kn{A) := . / A=)^(da;), n G N, 

and 

where P* is the adjoint operator of P considered on L'^{tt). 

In [26j it was shown that the following sequence of constants can be seen as an 

appropriate way to measure the periodic behavior of a Markov chain ^i, ^n+i, ■ ■ •• 

Kn := sup fc„(A), neN. 

2 Bounds for the spectral gap in terms of isoperi- 
metric constants in the general case 

Let us present an upper bound for the spectral radius of P in terms of the 
isoperimetric quantities fc„ and kpt'^pn. 

Proposition 1 Let ^i, ^2, • ■ • be a stationary Markov chain with invariant mea- 
sure TT, transition kernel p{-, •) and state space (fJ, J^). Then the condition 

< inf kn = inf inf kJA). (5) 
is necessary for the (SGP). More precisely, we obtain the following estimate: 
WP^'Wl > sup i f(l - kniA)f + 2niA)7r{A^) (1 - fcp...p„ (A))) . (6) 



Aej^ 
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The necessity of ^ follows immediately from since if (O is not satisfied, 
the right hand side in ^ is bounded away from zero for all n g N, so H-P^Hi 
is, which imply that the (SGP) cannot hold. The other direction of the proof is 
postponed to section [HI 

It is natural to ask whether the condition (O is also sufficient. In the fol- 
lowing, we will show that this is actually the case. In fact, we will see that an 
apparently weaker condition than ^ turns out to be equivalent to the (SGP). 
To show this, we will need the following result due to Lawler and Sokal [TB] . 
which says that 

^ia{Id-P))>^k^, (7) 

with 

. E{\{X + cr-{Y + c)'\) 

" = — Eiix+cr) — 

and 2? denoting the set of all possible distributions of i.i.d random variables 
{X,Y) with variance 1 and where ^{(j{Id — P)) means the real part of the 
spectrum of {Id — P). 

Additionally, Lawler and Sokal (TB] had shown that k > 1. 

If we apply the spectral mapping theorem (see e.g. j23j) to ([7]) with the 
function / = 1 — z, we obtain 

^iaiP))<l-^k\ 

Moreover, 

^HP-))<i~^kl. 

Now let us state our first result: 

Theorem 2 Let ^i,$2,--- be a Markov chain with state space {Q,J^,tt) and 
associated Markov operator P. Then the following statements are equivalent: 

1. P has an L'^{tt)- spectral gap. 



M := Y>0: kn> e Vn < 
// condition holds true, we obtain 



277 



arccos(l — ^e^) 



+ 1 U 0. (9) 



a(P- Pi) C BrM, ro := inf (1 - ^ ^ ^^^^ 

eeM 16 

where Br{0) is the ball of radius r with center 0. 

The necessity of (|9]) follows immediately from Proposition [l] so the interesting 
part of the theorem is the sufficiency. The theorem says that it is enough to 
check only finitely many n with fc„ > e for establishing the (SGP), where this 
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number again depends on e. 

Let us study the asymptotic behavior of ([9]) and (fTO|) for e ^ 0. First, note that 



2. 8. 1 ^^^^ 



arccos(l — ^e^) -\/2k e 

(in the sense ~ 6e <S=^ hmg^o §^=1)- This means that the number of n with 
kn > ewe have to check in order to apply Theorem [2] depends anti-proportional 
on e. 

Now we will show that if we do not add further assumptions, this is in a 
certain sense the weakest sufficient condition (and as shown necessary) for the 
existence of an L'^{tt) spectral gap in terms of isoperimetric constants fc„. To 
see this, let p be a prime number and consider the deterministic walk on the 
circle Z/pZ, i.e. 

Z/pZ + Z/pZ. 

It is readily seen that this Markov chain is positive recurrent. The invariant dis- 
tribution can be easily established to be the uniform distribution. Furthermore, 
one can check that ^ ^ 

ki> - y i < —— , 

P 1/P 



but 



kp^O. 



This holds true for all prime numbers p G N. So if we set e = i, we see that 

constant * ^ as suggested in Theorem [2] is the right order of magnitude of fc^'s 
to check and this cannot be improved. What certainly can be improved is the 
constant appearing in pT|) and it would be interesting to do this, because 
it would yield sharper bounds for the spectral gap of the Markov chain. 

Let us refine the asymptotic results obtained in (|lip . Using Taylor expansion 
the right hand side in PU)) can be written as 



exp(-^e3-|-0(e^)). 
From these calculations, we obtain 

Corollary 1 Assume that the Markov chain ^1,^2, ■ • • has the (SGP). Then the 
spectral gap 1 — p can be estimated by 

l-p>^kl-Om), (12) 



where k- := min{fc„, n G N}. 
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Corollary [T] may be interesting for Markov chains with very small spectral gaps. 
If it is possible to determine no with = fc_ , especially if fci = k-, Corollary 
[1] can be used to estimate the spectral gap 1 — p from below. For example, 
if P is a positive and self-adjoint operator, using the spectral representation 
theorem for P and the technique presented in the next section one can show 
that /ci — fc_. Unfortunately, this case is not very interesting, because Lawler 
and Sokal's result can be directly applied. Intuitively, monotonie of the fc„ 
is close related to the property that the Markov chain has at most a '"weak 
cyclic behavior"'. In general it seems to be difhcult to present conditions which 
ensure the monotonicity of the fc„. Even for reversible and not positive chains 
we can give only a partial satisfactorily answer, namely, in this case it can be 
established that k2n is monotonie increasing in n. The general problems arise 
from the difficulty to obtain precise information about the spectral measures 
associated to the family of indicator functions. 

Now let us show how Theorem[2]can be applied to prove sufficient conditions 
for the spectral gap property. 

Corollary 2 Assume that the Markov chain ^1,^2, •• ■ satisfies (0j with 
C{x) — C independent of x. Then P has an {it) -spectral gap. 

Proof: 



7r(A)7r(A-) 
2 



> 1-3^ / |p"(a;,^'=) -7r(A=)|7r(dx) 

I A 

> 1 — 2C(5" > -, n > uq, uq sufficiently large, (13) 



where 5 is given in 

In order to complete the proof, we need 

Lemma 1 Assume that there exists no G N, e > 0, such that kn > e Vn > no. 
Then we have: 

ki> — yi en. 

no 

With Lemma [T] it follows that 

fc„ > — Vn e N. 
2no 

The claim follows by applying Theorem [21 

□ 

Since the assumption in Corollary [2] is equivalent to the classical Doeblin 
condition (see [17]), we have just shown the well-known fact that the classical 
Doeblin condition implies the existence of an L^(7r)-spectral gap. Later we 



6 



will improve this result by comparing rates of convergence on different Banach- 
spaces. 

Moreover, from Theorem [2] we easily derive the following sufficient condition 
for the existence of an i^(7r)-spectral gap: 

Corollary 3 Assume that condition ^ is fulfilled. Moreover, assume that 
there exists e > such that 

limsup sup I 1^^%^ - l\Adx) < 1 - e. (14) 

Then P has an L'^ij:)- spectral gap. 
Proof: It is easy to show that 

> |. (15) 
Now the claim follows from Theorem [5] and Lemma [TJ 

□ 

Until now, our approach provides only a few information about the size of 
the spectral gap. In order to obtain more precise estimates for the spectral gap, 
we should analyze the asymptotic behavior of 1 — fc„ and 1 — /cp*"pn . 



3 Convergence rates of kn and kp*^pn 

In this section we will present the main theorem of this paper, namely the fact 
that for normal Markov operators we get a precise formulation for the spectral 
gap of P in terms of 1 — kp*"p^. This result is surprising and has important 
consequences as will be shown later on. 

In order to deduce the interesting properties of fc„ and kp,n pn , we should men- 
tion the following elementary but important 

Lemma 2 The sequence kp,"- pn(A) is monotonic increasing in n and we have 

kp.^P„{A) < 1 Vn e N, A e J^. (16) 

Let us return to Proposition [TJ From ^ it can be immediately obtained 
that ^ 

p:= lim ||P"||^ > lim max(l - A:„,i4:„ - 1)". (17) 

n — ^oo " n — *oo 

So the (SGP) implies the geometric convergence of Kn and fc„ to 1, which is 
equivalent to the geometric convergence of Kn — kn to zero. Indeed, it turns out 
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that this rate of convergence to zero is closely related to the spectral radius p{P). 
To see this, first note that p7|) is based on ([6]), which sometimes turns out to be 
not as good as possible from an asymptotic point of view. The following equality, 
which can be derived by similar calculations as in the proof of Proposition [TJ 
indicates that for investigating the spectral properties of P, one should have a 
closer look at the isoperimetric constants kp.^^p^: 

||P'7o||2 = Vl-fcp-P'M (18) 

with /o as in so that 

p> lim (jl - kp,^pr}\ " (19) 

Note that this inequality is a lower bound for the spectral radius of P and hence 
yield to an upper bound of the spectral gap. We will see that for normal op- 
erator actually equality hold in (fT9| . Before we are going to prove this, let us 
compare fc„ to fcp."p„. 

Since we have equality in (|18p and the estimate in (|17p is also derived by con- 
sidering ||P"/o||2, the right hand side in (fT9| must be at least as large as the 
right hand side in ([TT]). In general, we have the following relationship between 

and kp^^^pn'. 

Lemma 3 Let ^1,^2, ■■ • be a Markov chain with invariant measure n. Then we 
have the following inequality 

l-K< V2\/l - kp,^p^. (20) 

Having established an upper bound, it would be nice to achieve a lower 
bound for the spectral gap in terms of isoperimetric constants. For this reason 
let us consider an arbitrary indicator function g in Lg i.e. 



I 

i=l 



where Ai £ T such that U'Li Ai ^ Q., AiC] Aj =^ % ior i ^ j , i, j e {I, , . . . ,lg}. 
Then we have 



Jensen f --A- 1 /" o 

Jn ~{ T^yAi) Jn 
^ ^^(A^)(l-fcp..p„(AO)<?,(l-fcp."p.). (21) 
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So for all indicator functions g we obtain 

lim ||P"g||^ < lim Ul-kp,^p^^ . (22) 

n — ^oo n — ^oo V / 

Note that the convergence in (|22p is not uniform with respect to the class of 
step functions, so we cannot pass to limit in order to obtain this estimation 
for all / e Lf^i"^)- There are mainly two reasonable ways to continue. First, 
one could consider only such functions / £ Li^ij^) which can be sufficiently fast 
approximate by step-functions in a certain sense. For the second way, which we 
will follow here, one can put some additional assumptions on the operator P. 
For this reason let us define 

Definition 2 We call a sequence of operators (A„)„gN positive and quasi self- 
adjoint generated, if there exists a positive and selfadjoint operator Q, constants 
C > 0, q < 1 such that for all indicator functions g (£ Lq ^^(Tr) 

||(A„-Q")5|| <Cg" VneN. (23) 

Now let us state the main theorem of this paper: 

Theorem 3 Assume that the sequence {P* P")„£n associated to the Markov 
chain , ^2 , • • ■ is quasi selfadjoint. Then we have for all f CI Lq (tt) 

lim I ll < max ( lim (^l-kp,^p,.) " , g V (24) 

where q is as in !123\} . 

Proof: For the proof we need the spectral theorem for selfadjoint opera- 
tors (see e.g. [23]). For all g e LI i{tt) we have 

- (Q"5,5>^ + {{P*"P" - Q")ff,5>. (25) 
Since Q is selfadjoint, we obtain 

(Q"5,5>. = / X^{dE^g,g)^ 
J'y{Q) 

> If \''{dE>^g,g)A ={Q''g,g)l. (26) 
\J<yiQ) J 

This together with (gH]) and (HT)) yields 

(Q'5,5>. < (Q".9,.9>l < ((P*'>".9,.g>. + |((P*"P" - g")g,5).l) " 

k_ 

< max (2(P*"P"g,g).,2|((P*'>" - Q").g,. g>^l) " 

< max('(2?,(l-fcp.np„))" ,(2||(P*>"-Q")5ll2)") (27) 
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Now with n ^ oo wc obtain for all fc G N 

(Q'=ff,5)- < max( lim (1 - fcp."p„) " , g*'') . (28) 

At this point note the small but crucial difference between (|28p and (PT|) . By 
taking advantage of the reversible structure of Q, we get rid of the therm Ig. 
Now the result follows since f £ Lq j^(7r) can be approximate arbitrary well by 
indicator functions g £ L^^ (tt) . 

□ 

As an immediate consequence we obtain 

Corollary 4 Assume that the operator P associated to the Markov chain ^i, ^2, ■ 
is normal. Then the spectral radius p is given by 

p = hm (jl - kp.'^p^) " . (29) 
Especially, for reversible Markov chains this yields 

p = hm f - k2n) " . (30) 

n^QO \ / 

Moreover, if the Markov chain is also positive, we obtain 

p= lim (l-fc„)" . (31) 

n — '■00 

Proof: We see that for Q P*P, ^ is satisfied with q ^ 0. Now the 
claim follows from Theorem [3] and p^ . The second assertion is trivial, since 
p*p _ p2 ^-f^^ adjoint case. The last assertion follows from the fact that 
can do the proof of Theorem [3] directly using P instead of Q. 

□ 

Let us interpret this results in terms of spectral measures: Denote p,f{dX) = 
d{E\f, f) the spectral measure according to the Markov operator P and / € 
Lg i{'k). Then the Rayleigh-Ritz principle for reversible Markov chains implies 
that there exists a sequence of functions (/i)ieN C Lq ^{ti) such that for all e > 
we have 

lim/./,(i?p(e)) = l, (32) 

1 — >oo 

where Bp{e) := {y G R : |y — p| < e}. Let us consider a certain, non-linear 
subspace of Lq i{Tr), namely the step-functions of the following kind: 

/. = VAARa^) (^1. - ^1.^) ,AeT. (33) 

It can be easily checked that 

E^fA = 0, ||/a||2 = 1. (34) 
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Moreover, we have the following identity: 

1 - = {fA,P''fA). - J'^ AV/^(dA). (35) 

Since p — lim„_,oo(l — fc2n)^, it follows that there exists a sequence (Ai)igN C 
such that for all e > we have that 

lim Ai/,^(i3p(e))>0, (36) 

In comparison to the general Rayleigh-Ritz-principle, which says that the family 
of spectral measures /x/,/ G Lq j^(7r) contains measures that are in a certain 
sense arbitrary close to the dirac-measure at p, we see that the family of spectral 
measures induced by the family of simple step functions defined by p3p contains 
for every e > at least some measures that put a positive amount of mass to 
the ball Bp{e). 



4 Applications 

In this section we will present some applications of the developed theory. The 
first applications should indicate the theoretical value of Theorem [3] Later we 
will see how to use Theorem[3]in order to obtain precise bounds for the M/M/1- 
queue, the hypercube and the star. Let us start with a partially generalization 
of a Theorem due to Roberts and Tweedie pU] : 

Theorem 4 Assume that , ^2 , ■ • ■ is a geometrically ergodic and normal Markov 
chain with (ORGC) 5. Then $1,^2, ■ • ■ has the (SGP) and the spectral radius p 
can be estimated by 

p<5 

Proof: 

From (gni) it follows that 

(l-fc^.,^,(A))^ <(l-fcp..p„(A))^. (37) 
In ([76]) we will show that 

l-fcp..p„(A) - ^(^)t(A^) l^ip"i^,A^)~n{A^))\{dx). (38) 

Now, using ^ and Theorem [Jit follows that 

1 

(l-fcp.<p,(A))^ < (^--^^^l^^(p»(x,A=)-^(A=))Mrf^))" 

Now let first n — > cxd, then take the infimum over all yl G and finally let 
I 00. Now the claim follows from Theorem [3l 
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□ 

Note that this result is known for reversible Markov chains [50] . The existing 
proof for that case is more difficult and requires results that have been appeared 
in [HI- 

Let us improve the results obtained in Corollary [21 

Corollary 5 Assume that the Markov chain ^1,^2, ■• ■ satisfies @) with 

C{x) = C independent of x and that additionally P* has a representation of the 

form 

P*f{x) := / f{y)p*{x,dy), (40) 



such thatp*{x, ■) is a transition-probability kernel. Then P has an L^{Tr)- spectral 
gap and the spectral radius p can be estimated by \/S. 

Proof: 

Putting ([2S|) and ([37|) together we obtain for h < h E N that 

/ 1 

< 

~ \tt{A)tt(A'' J 

2 



tt{A)t:{A 

[{P*" Pn^'HU - niA))y {xMdx) 



< 



TT{A)Tr{A^ 



sup lb* (x, •) - ^ll^'e- sup •) - ^H^'f. 

(41) 

Now let ^2 — *■ 00 and take the supremum over all A G ^ we obtain 

(1 - fc(p."p„)2,i)5Tr < 2CS'\ (42) 

Since P*" is selfadjoint and positive, we can apply Corollary [H For /i ^ 00 
we get 

p{P*"P'') < 2C6'\ (43) 
where p{P*" P^^) denotes the spectral radius of p*" p^ on Lq. Since 

p(P*"F") = sup {P*"P''fJ). - (44) 

with the operator-norm, we finally obtain, by taking the 2n — t/i-square 

root and let n —> cx3, that 

p<V5. (45) 
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□ 

This result is well-known and can be alternatively obtained by applying 
Riesz-Thorin's- interpolation theorem (see e.g. f5^) in the following way. By 
Riesz-Thorins-theorem it follows that 

ll/llLg<ll/ll!s=ll/lltv 

In |24j . Proposition 1, the following inequality was shown: 

snp\\p''+\x,-)-TT\\Tv < ||P"-Pi||oo < sup|b"(x,-)-^||Ty. (47) 

Here, || ■ ||oo denotes the supremum-norm over all bounded, measurable complex 
valued functions. Now from ([46]) and (|47|) also we also obtain the result of 
CoroUarylH But note that the proof of Corollary [5] actually yields more. In the 
proof we used the following trivial estimate: 

sup \\p* {x, •) - ttWtv < 2. 

But if we have that p* (x, •) is also uniformly ergodic, which is for finite Markov 
chains always the case, we immideately obtain 

Corollary 6 Let , ^2 , ■ • ■ be a Markov chain as in Corollary and assume, 
that additionally, there exists a constant C* > and 5* < 1 such that 

snp\\p*"{x,-)-7T\\Tv <C*S*\ (48) 
sen 

Then 

p < \f5S*. (49) 

Proof: See the proof of Corollary 6 

□ 

Now the theorems will be applied to obtain explicit bounds for some Markov 
chains. 

Example 1 (M/M/l-Queue) Let us consider the M/M/1 queuing system 
with state space N and reflecting boundary at one. More precisely, the non zero 
elements of P = (pi j)(i j)gNa:N are 

Pi,i+i = Vi e N 
p^.,-i = p, N3i>2 
Pis = P, (50) 

where p > ^ in order to guarantee that the associated Markov chain is positive 
recurrent. This example was studied in the book of Feller f\13}j . page 436-438). 
From that one can deduce (for example by using that spectral radius=(ORGC) 
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for reversible Markov chains) that the LP' {t:)- spectral radius p of this chain is 
given by 

P = 2Vpg- (51) 

Now we will proceed by showing how to derive this bound by using isoperimetric 
constants. For s G N let 

As :={s,s + l,s + 2,...} 
The invariant measure n can be easily calculated to be 

P \PJ 
It follows that for all s d N we have 

s-l 



This yields 



2 
P. 

oc 

^^^^/"(, + ,,{l,2,...,.-l})(^) . (52) 



(?) 



s-l 



The interesting observation now is that for s > n, S58^) is decreasing in s, and 
one might hope that lims_>oo k„{As) = kn- We have 

oo / \ * 

lim fc2„(A.) = " Vp'"(s + z,{l,2,...,5-l}) p . (53) 

s-^oo p ^ \pj 

Note that p'^"'(s + i, {1, 2, . . . , s — 1}) does not depend on s for s > n. Now I15S\) 
implies that 

p-q 



P TTo VP 



1- lim A;2„(A,) = ^^Vp'"(s + i,{s,s + l,s + 2,...}) p . (54) 
Since for s > n we have 



"(. + ^, . + 1, . + 2, . . .}) = 5: ( (55) 
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Now we apply {54^ to 1551) and obtain 



2n i 



> (57) 



P 

Using Stirling's approximation and apply Corollary^ one obtains 

p = lim (1 - fc2„)^ > lim (1 - lim fc2„(A,))^ > 2^. (58) 

n — i-oo 71 — *oo s — 'oo 

On the other hand, the right hand side in Ii56\} can be estimated from above by 

^P"V>"I:E f-V"'*' < ^4n^P"V>"- (59) 
P V'^/ ~^~'o\PJ P 

Again, applying Stirling 's formula yields 

lim (1 - lim fc2„(A,))5^ < 2VM- (60) 

This together with li58\) gives 

(1 - lim k2n{As))^ = 2V^. (61) 

We have just shown that p > 2y/pq and if we also have that 

1 

\ - lim k2n{As)) = (1 - fc2„)^, (62) 



then p = 2^/pq. On the opposite, since we know already that p — 2^Jpq, it 
follows that i62\) holds 



Note that the estimate in ((57)) is very rough but still suffices to obtain a sharp 
lower bound for the spectral radius p. 

Let us recapitulate the procedure. In order to obtain a "good" lower bound 
for the spectral radius p, we choose from our intuition (or our knowledge) a 
sequence (As)^^^ such that it is supposed that 

lim kn{As) « fc„. (63) 



Next we calculate, if possible, the left hand side in (|63p and obtain a lower 
bound. If in fact (|63p holds true in a certain sense, the archived lower bound 
for the spectral radius is sharp. Unfortunately, in most cases it is rather diffi- 
cult to verify ([63]) . so one depends on a "'good"' feeling for the right sequence 
(As)sgN. Nevertheless, we will give two more examples where the intuitive choice 
of (As)^^^ yield sharp lower bounds for the spectral radius p. 
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Example 2 (Lazy random walk on hypercube) We will show how we can 
apply Theorem\^to obtain a sharp upper estimate oj the spectral gap oj the lazy 
random walk on the hypercube by analyzing the spectral properties of a much 
simpler Markov chain. Let us consider the random walk on the hypercube with 
the set V of vertices given by 

V := {x ^ {xi,X2, ■ ■ ■ ,Xn) ■■ ^ {0, G {1,2, . . . ,n}}. 

The transition probabilities p{-, •) of the random walk are given by 







x^y 

l|2;-y||2 = l ■ 
else 



Define 

A = {x€V : xi=Q}. 
For all X € A, m € N we have 

p"(a;,v4^) = (l,0)Q™Q, 

An easy calculation shows 



where Q 



^ 2n 2n 



J_ 

2n 



1 - 



2n 



p-i.,A^) = l-lii-'-y 



From this we obtain 

lim (l-fc2,„(A))^ 
and therefore 



lim 

7n—*OG 



p> 1 



(64) 



In fact, it is known that p — 1 — ^, so the bound is sharp. 

Example 3 (The star) Consider the following Markov chain with state space 
{1, 2, . . . , n} Let a; G (0, 1), i G {1, 2, . . . , n} such that 

n 

J2a^ = l. 



Let the transition probabilities be given by 



P 



/ 


ai 


02 


as ■ 


• On 


\ 




1 





. 


. 






1 





. 


. 












. 




V 


1 





. 


. 


/ 



(65) 
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The invariant distribution tt of this Markov chain can be easily verified to be 

7t{1)^-^, 7r(z) = i e {2,3,...,n}. 

It can be checked that the chain is reversible, so Theorem\^can be applied. Since 
everything flows out of the set A := {2, 3, . . . , n} in one step, one expects that 

lim (1 - fc2„(A))'/'" = lim (1 - fc2„)'/'"- 

n — >oo n — >oo 

Again, the left hand side can be calculated as in the example before with the 
Matrix 

\ ai 1 — ai 

The second largest eigenvalue of this Matrix equals — oi. This yields 

P > ai. 

Actually, it can be shown that p = ai, so the bound is sharp. Again, we have just 
seen that Theorem\3[ can be used to reduce the complexity of the original chain by 
analyzing a much simpler chain. The following observation may be interesting: 
The special structure of the star yields to the fact that independent of the chosen 
set A it can be shown that lim„_>oo(l — k2niA)Y^'^^ — lim„^oo(l — k2nY^'^^- 
This is in some sense good and bad. It is good because it shows that even an 
arbitrary chosen set A may yield a sharp lower bound for the spectral radius of 
P — Pi. It is bad because it shows that it might be impossible to recover all or 
most eigenvalues of the Markov chain by computing lim„^oo(l ~ k2n{A)y^'^"' 
for different sets A. In this example, the eigenvalue with multiplicity n ^ 2 is 
not recognized. 



5 Isoperimetric constants and mixing 

This section is included to interpret the obtained results in terms of mixing. Let 
{Xn)neN denote a family of random variables on a probability space {U,,!F,P) 
and let Tm ■= o'(-^i,-^2, • • • ,X,n) and J^" :— cr(X„,X„+i, . . .). The sequence 
(X„)„gN is called strongly mixing (a-mixing) if a{n) — > 0, where 

a(n):=sup sup \P{Af^B)-P{A)P{B)\, 
k>i AeJ^k-BeJ='''+" 

uniform mixing ((/)-mixing) if (t>{n) — > with 

X \P(Ar\B) - P(A)P(B)\ 
0(n):=sup sup ^ ^ ' ^ '\ (66) 

k>l A£j^k,B<£y^''+" ^[^) 

and asymptotically uncorrelated (p-mixing) if p(n) with 

p(n) snp{corr{U, V), U G L^{Tk), V G L^{T''+''). 



17 



It is well-known that c/i-mixing implies p-mixing and p-mixing implies a-mixing 
(see e.g. [15]). In fact, p-mixing is equivalent to the (SGP) and 0-mixing is 
equivalent to uniform ergodicity ([H]). The question we will answer is how 
to compare p-mixing with (/)-mixing from their definitions. We want to have a 
mixing definition which is equivalent to p-mixing, but comparable with </)-mixing 
in a sense as geometric ergodicity is comparable to uniform ergodicity. 
During the last two sections we have seen that the (SGP) is equivalent to 

1-A:„<C<5", (67) 

for some C > and 6 < 1. But this is equivalent to 

\PiA,nAl^J~P{A,)PiAl^J\ 

sup sup ■ , — < C(S", (68) 

fe>i Afce<T(Cfc),A-^^ea(5„+fc) P[Ak) 

for some (7 > 0, (5 < 1 as in (p7|) and 

Ak :— X Q2 ^ ■ ■ ■ ^k-i X A X Slfe+i X . . . with Qt = Q. 



So (|68p can be seen as a suitable way to weaken (|66|) in such a way that an 
equivalent condition to the (SGP) is obtained. Moreover, in the reversible case 
the optimal chosen S on the right hand side in (|68p equals the spectral radius 
of P- Pi on L^iir). 

6 Proofs 

Proof: In ((33|l we considered the function 

/o = V^AMA^) (^-i^U - ^1a=) ,A&:F. (69) 
We have already seen in (|34p that 

S-/0 = 0, ||/o||2 = l. 

From the definition of /q we calculate 

1 r . 1 



= 7:{A)7:{A^) [ j^p^:,^AfTr{dx) + -^ / p{x,A^fn{dx) 



1 



7r(A)7r(A-) A, 



p{x,A)p{x,A^)T:{dx)\ . (70) 
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We have 



^ ^ p{x,A)'^Tr{dx) = / ( / p{x,Ay^TT{dx)+ I p{x, Ay^Tr{dx) 



1 



7r(A)2 
1 



{l~p{x,A'')YTl{dx)+ / p{x,AYTT{dx) 
J A" 

n{A) - 2n{A)7r{A')k{A) + [ p{x, A^' fnidx) 
p{x, AYTr{dx) 



Jensen 1 

7r(A) 



Using A: (A) = k{A'^) one obtains 

> ^(1 - n{A)k{A){2 - k{A))). (71) 

Inserting the last three inequahties in ([70)1 yields 

ll^'/olli > 7riA^)-7riA^niA){2^k{A))+7riA)-7r{Afk{A)i2-k{A)) 
-2Tr{A)Tr{A'=)kp*p{A) 
= (1 - fc(A))2 + 2niA)niA-) ((1 - fcp.p(A)) - (1 - 

> i(l-fc(v4))2 + 2^(AV(A=)(l-fcp.p(A)). (72) 

The same calculations can be done with P" instead of P. So the claim follows. 

□ 

Now let us proof Theorem [2] 

Proof: Since ([5])-which was established to be necessary for the existence 
of a spectral gap (Proposition [T]- implies (O, we already know that ^ is neces- 
sary. It remains to show that this condition is also sufficient. 
So let us assume that © holds. Then there exists e > such that 



K(ct(P")) < 1 - Vn < 



277 



arccos(l — ^e^) 

On the other hand, from the spectral mapping theorem we know that 



(73) 



a(F") = (a(P))". (74) 
Now let z E cr{P)- Using polar coordinates, z has the representation 

z = z(r, (/)) = re'"^ = r cos(0) + sin((/)), rG[0, 1], (/) G [— tt, tt). 
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From ([73]) and we obtain 



cosnc^ < 1 - -e , Vn < 



2n 



arccos(l — j^e^) 



Now choose 0i = arccos(l — Y|e^) H (0, tt). For all </> S (— 0i, 0i) and z{r, cf)) £ 
ct(-P), we obtain: r < 1 — For those z{r,(f) with (/i ^ (— there 



exists n = n{4>) < no(0i) 
together with ([71)) yields: 



2ir 



+ 1, such that n(j>[mod2TT] e (— This 



We should now prove Lemma [TJ 
Proof: For ie{l,2,...,no} consider 



r]+l) 



□ 



TriA)7riA')KiA) 



p'^{x,A'')Tr{dx) 



p(2;,dy)p"-^(y,A^)7r(dx) 



/A J A 



+ / 7:{dx) / 

J A" 

< (kn-iiA) + fc(A))7r(A)7r(A") < . . . < n fci(A)7r(A)7r(yl"). 
For the other ki, i £ {2, . . . ,no — 1} we have 



1 



2[log2 no] 



Since 
and 

the claim follows. 



■2[iog2no] >noVie {2,...,no-l} 
2[i°g2"o] < 



□ 



Let us present the proof of Lemma [2 

Proof: First let us proof the second statement: 



— ^i— - / P*"P"l^e7r(dx) 
TT{A)Tr{A<') J A 

^ p"-{x,A)p'\x,A'')TT{dx) 

{p-{x,A^))Mdx) 



niA)niA-) Jn 
1 



1 



Jensen 
< 



n{A) n{A)7r{A'') 
1 niA") 



Tr{A) tt{A) 



1. 
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The first assertion follows from 

1 1 

ttJA) ~ 7r{A)TT{A-) Jn 

P Markov 1 1 

7r(A) n{A)7r{A'') 



□ 



We should now present the proof of Lemma [3] 

Proof: Without loss of generality we may assume that tt{A) < i. On 
the one hand we have 

l-fc„(A) = 1- .Jf,,, f p"{x,A^)7ridx) 



c.s. 



" \Ja^ TriA)7r{A-)J ^^(A^ 

< V2^ ^(^)^(^c) jj.<A^)-p-{x.A^))Mdx). (75) 
On the other hand we get 

l-fcp..p„(A) = 1- I p-{x,A)p-{x,An7r{dx) 



7r(da;) \ = 1 



1 



A=) - 7r(A=))27r(dx). (76) 



TriA)n{A-) 
1 



7r(A)7r(A-) 
Reinsert (l76l) in (l75l). we achieve 



1 - fc„(A) < V2v/1 - /cp."p.(A). (77) 

Now taking the supremum first on the right and thereafter on the left hand side, 
the claim follows. 

□ 
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